Learning Long-Range Vision for an Offroad Robot
نویسنده
چکیده
Teaching a robot to perceive and navigate in an unstructured natural world is a difficult task. Without learning, navigation systems are short-range and extremely limited. With learning, the robot can be taught to classify terrain at longer distances, but these classifiers can be fragile as well, leading to extremely conservative planning. A robust, high-level learning-based perception system for a mobile robot needs to continually learn and adapt as it explores new environments. To do this, a strong feature representation is necessary that can encode meaningful, discriminative patterns as well as invariance to irrelevant transformations. A simple realtime classifier can then be trained on those features to predict the traversability of the current terrain. One such method for learning a feature representation is discussed in detail in this work. Dimensionality reduction by learning an invariant mapping (DrLIM) is a weakly supervised method for learning a similarity measure over a domain. Given a set of training samples and their pairwise relationships, which can be arbitrarily defined, DrLIM can be used to learn a function that is invariant to complex transformations of the inputs such as shape distortion and rotation. The main contribution of this work is a self-supervised learning process for long-range vision that is able to accurately classify complex terrain, permitting improved strategic planning. As a mobile robot moves through offroad environments, it learns traversability from a stereo obstacle detector. The learning architecture is composed of a static feature extractor, trained offline for a general yet discriminative feature representation, and an adaptive online classifier. This architecture reduces the effect of concept drift by allowing the online classifier to quickly adapt to very few training samples without overtraining. After experiments with several different learned feature extractors, we conclude that unsupervised or weakly supervised learning methods are
منابع مشابه
Machine Learning Applied to Terrain Classification for Autonomous Mobile Robot Navigation
We work on the Stanford AI Lab team for the DARPA-funded Learning Applied to Ground Robotics (LAGR) project. Each of the eight competing teams in this program write code for a standardized robot platform equipped with short-range sensors and two pairs of stereo cameras. The long term focus of the program is the advancement of the state-of-the-art in computer vision and offroad autonomous mobile...
متن کاملA Multi-range Vision Strategy for Autonomous Offroad Navigation
Vision-based navigation and obstacle detection must be sophisticated in order to perform well in complicated and diverse terrain, but that complexity comes at the expense of increased system latency between image capture and actuator signals. Increased latency, or a longer control loop, degrades the reactivity of the robot. We present a navigational framework that uses a self-supervised, learni...
متن کاملOnline Learning for Offroad Robots: Spatial Label Propagation to Learn Long-Range Traversability
We present a solution to the problem of long-range obstacle/path recognition in autonomous robots. The system uses sparse traversability information from a stereo module to train a classifier online. The trained classifier can then predict the traversability of the entire scene. A distance-normalized image pyramid makes it possible to efficiently train on each frame seen by the robot, using lar...
متن کاملVisual Terrain Traversability Estimation Using a Combined Slope/Elevation Model
A stereo vision based terrain traversability estimation method for offroad mobile robots is presented. The method models surrounding terrain using either sloped planes or a digital elevation model, based on the availability of suitable input data. This combination of two surface modeling techniques increases range and information content of the resulting terrain map.
متن کاملRobot Motion Vision Part II: Implementation
The idea of Fixation introduced a direct method for general recovery of shape and motion from images without using either feature correspondence or optical flow [1,2]. There are some parameters which have important effects on the performance of fixation method. However, the theory of fixation does not say anything about the autonomous and correct choice of those parameters. This paper presents ...
متن کامل